AITopics | stochastic relaxation

As hyper-parameters are ubiquitous and can significantly affect the model performance, hyper-parameter optimization is extremely important in machine learning.

artificial intelligence, machine learning, optimization, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.94)

Add feedback

b7500454af92cf3934eb1cc2d59abbdf-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 05:39:51 GMT

artificial intelligence, machine learning, val, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)

Add feedback

Gradient is All You Need?

Riedl, Konstantin, Klock, Timo, Geldhauser, Carina, Fornasier, Massimo

arXiv.org Artificial IntelligenceJun-16-2023

In this paper we provide a novel analytical perspective on the theoretical understanding of gradient-based learning algorithms by interpreting consensus-based optimization (CBO), a recently proposed multi-particle derivative-free optimization method, as a stochastic relaxation of gradient descent. Remarkably, we observe that through communication of the particles, CBO exhibits a stochastic gradient descent (SGD)-like behavior despite solely relying on evaluations of the objective function. The fundamental value of such link between CBO and SGD lies in the fact that CBO is provably globally convergent to global minimizers for ample classes of nonsmooth and nonconvex objective functions, hence, on the one side, offering a novel explanation for the success of stochastic relaxations of gradient descent. On the other side, contrary to the conventional wisdom for which zero-order methods ought to be inefficient or not to possess generalization abilities, our results unveil an intrinsic gradient descent nature of such heuristics. This viewpoint furthermore complements previous insights into the working principles of CBO, which describe the dynamics in the mean-field limit through a nonlinear nonlocal partial differential equation that allows to alleviate complexities of the nonconvex function landscape. Our proofs leverage a completely nonsmooth analysis, which combines a novel quantitative version of the Laplace principle (log-sum-exp trick) and the minimizing movement scheme (proximal iteration). In doing so, we furnish useful and precise insights that explain how stochastic perturbations of gradient descent overcome energy barriers and reach deep levels of nonconvex functions. Instructive numerical illustrations support the provided theoretical insights.

artificial intelligence, machine learning, optimization, (18 more...)

arXiv.org Artificial Intelligence

2306.09778

Country:

Europe > Germany (0.28)
North America > United States > Michigan (0.14)
Europe > Switzerland (0.14)
(2 more...)

Genre: Research Report (0.83)

Industry:

Energy > Oil & Gas > Upstream (0.45)
Information Technology > Security & Privacy (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback

Non-local Optimization: Imposing Structure on Optimization Problems by Relaxation

Müller, Nils, Glasmachers, Tobias

arXiv.org Machine LearningNov-11-2020

In stochastic optimization, particularly in evolutionary computation and reinforcement learning, the optimization of a function $f: \Omega \to \mathbb{R}$ is often addressed through optimizing a so-called relaxation $\theta \in \Theta \mapsto \mathbb{E}_\theta(f)$ of $f$, where $\Theta$ resembles the parameters of a family of probability measures on $\Omega$. We investigate the structure of such relaxations by means of measure theory and Fourier analysis, enabling us to shed light on the success of many stochastic optimization methods. The main structural traits we derive, and that allow fast and reliable optimization of relaxations, are the resemblance of optimal values of $f$, Lipschitzness of gradients, and convexity.

probability measure, relaxation, stochastic relaxation, (13 more...)

arXiv.org Machine Learning

2011.06064

Country:

Europe > Germany (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Model Based Image Compression and Adaptive Data Representation by Interacting Filter Banks

Okamoto, Toshiaki, Kawato, Mitsuo, Inui, Toshio, Miyake, Sei

Neural Information Processing SystemsDec-31-1990

To achieve high-rate image data compression while maintainig a high quality reconstructed image, a good image model and an efficient way to represent the specific data of each image must be introduced. Based on the physiological knowledge of multi - channel characteristics and inhibitory interactions between them in the human visual system, a mathematically coherent parallel architecture for image data compression which utilizes the Markov random field Image model and interactions between a vast number of filter banks, is proposed.

compression, data compression, image data compression, (13 more...)

Neural Information Processing Systems

Country:

North America > United States (0.05)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.36)

Add feedback

Model Based Image Compression and Adaptive Data Representation by Interacting Filter Banks

Okamoto, Toshiaki, Kawato, Mitsuo, Inui, Toshio, Miyake, Sei

Neural Information Processing SystemsDec-31-1990

To achieve high-rate image data compression while maintainig a high quality reconstructed image, a good image model and an efficient way to represent the specific data of each image must be introduced. Based on the physiological knowledge of multi - channel characteristics and inhibitory interactions between them in the human visual system, a mathematically coherent parallel architecture for image data compression which utilizes the Markov random field Image model and interactions between a vast number of filter banks, is proposed.

compression, data compression, image data compression, (13 more...)

Neural Information Processing Systems

Country:

North America > United States (0.05)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.36)

Add feedback

Model Based Image Compression and Adaptive Data Representation by Interacting Filter Banks

Okamoto, Toshiaki, Kawato, Mitsuo, Inui, Toshio, Miyake, Sei

Neural Information Processing SystemsDec-31-1990

To achieve high-rate image data compression while maintainig a high quality reconstructed image, a good image model and an efficient way to represent the specific data of each image must be introduced. Based on the physiological knowledge of multi - channel characteristics and inhibitory interactions between them in the human visual system, a mathematically coherent parallel architecture for image data compression which utilizes the Markov random field Image model and interactions between a vast number of filter banks, is proposed.

compression, data compression, image data compression, (13 more...)

Neural Information Processing Systems

Country:

North America > United States (0.05)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.36)

Add feedback

Filters

Collaborating Authors

stochastic relaxation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

b7500454af92cf3934eb1cc2d59abbdf-Paper-Conference.pdf

b7500454af92cf3934eb1cc2d59abbdf-Supplemental-Conference.pdf

Efficient Hyper-parameter Optimization with Cubic Regularization

b7500454af92cf3934eb1cc2d59abbdf-Supplemental-Conference.pdf

Gradient is All You Need?

Non-local Optimization: Imposing Structure on Optimization Problems by Relaxation

Model Based Image Compression and Adaptive Data Representation by Interacting Filter Banks

Model Based Image Compression and Adaptive Data Representation by Interacting Filter Banks

Model Based Image Compression and Adaptive Data Representation by Interacting Filter Banks